aPPRove: An HMM-Based Method for Accurate Prediction of RNA-Pentatricopeptide Repeat Protein Binding Events
نویسندگان
چکیده
Pentatricopeptide repeat containing proteins (PPRs) bind to RNA transcripts originating from mitochondria and plastids. There are two classes of PPR proteins. The [Formula: see text] class contains tandem [Formula: see text]-type motif sequences, and the [Formula: see text] class contains alternating [Formula: see text], [Formula: see text] and [Formula: see text] type sequences. In this paper, we describe a novel tool that predicts PPR-RNA interaction; specifically, our method, which we call aPPRove, determines where and how a [Formula: see text]-class PPR protein will bind to RNA when given a PPR and one or more RNA transcripts by using a combinatorial binding code for site specificity proposed by Barkan et al. Our results demonstrate that aPPRove successfully locates how and where a PPR protein belonging to the [Formula: see text] class can bind to RNA. For each binding event it outputs the binding site, the amino-acid-nucleotide interaction, and its statistical significance. Furthermore, we show that our method can be used to predict binding events for [Formula: see text]-class proteins using a known edit site and the statistical significance of aligning the PPR protein to that site. In particular, we use our method to make a conjecture regarding an interaction between CLB19 and the second intronic region of ycf3. The aPPRove web server can be found at www.cs.colostate.edu/~approve.
منابع مشابه
Pentatricopeptide repeat proteins involved in plant organellar RNA editing
C-to-U RNA editing has been widely observed in organellar RNAs in terrestrial plants. Recent research has revealed the significance of a large, plant-specific family of pentatricopeptide repeat (PPR) proteins for RNA editing and other RNA processing events in plant mitochondria and chloroplasts. PPR protein is a sequence-specific RNA-binding protein that identifies specific C residues for editi...
متن کاملA Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins
The pentatricopeptide repeat (PPR) is a helical repeat motif found in an exceptionally large family of RNA-binding proteins that functions in mitochondrial and chloroplast gene expression. PPR proteins harbor between 2 and 30 repeats and typically bind single-stranded RNA in a sequence-specific fashion. However, the basis for sequence-specific RNA recognition by PPR tracts has been unknown. We ...
متن کاملIn silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties
Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...
متن کاملImproved Computational Target Site Prediction for Pentatricopeptide Repeat RNA Editing Factors
Pentatricopeptide repeat (PPR) proteins with an E domain have been identified as specific factors for C to U RNA editing in plant organelles. These PPR proteins bind to a unique sequence motif 5' of their target editing sites. Recently, involvement of a combinatorial amino acid code in the P (normal length) and S type (short) PPR domains in sequence specific RNA binding was reported. PPR protei...
متن کاملElucidation of the RNA Recognition Code for Pentatricopeptide Repeat Proteins Involved in Organelle RNA Editing in Plants
Pentatricopeptide repeat (PPR) proteins are eukaryotic RNA-binding proteins that are commonly found in plants. Organelle transcript processing and stability are mediated by PPR proteins in a gene-specific manner through recognition by tandem arrays of degenerate 35-amino-acid repeating units, the PPR motifs. However, the sequence-specific RNA recognition mechanism of the PPR protein remains lar...
متن کامل